Monaural Speech Separation Based on Gain Adapted Minimum Mean Square Error Estimation

نویسندگان

Mohammad H. Radfar

Richard M. Dansereau

Wai-Yip Chan

چکیده

We present a new model-based monaural speech separation technique for separating two speech signals from a single recording of their mixture. This work is an attempt to solve a fundamental limitation in current model-based monaural speech separation techniques in which it is assumed that the data used in the training and test phases of the separation model have the same energy level. To overcome this limitation, a gain adapted minimum mean square error estimator is derived which estimates sources under different signalto-signal ratios. Specifically, the speakers’ gains are incorporated as unknown parameters into the separation model and then the estimator is derived in terms of the source distributions and the signal-to-signal ratio. Experimental results show that the proposed system improves the separation performance significantly when compared with a similar model without gain adaptation as well as a maximum likelihood estimator with gain estimation. A preliminary version of this paper was presented at the IEEE Workshop on Machine Learning for Signal Processing (MLSP) held in Thessaloniki, Greece in August 2007. M. H. Radfar (B) · R. M. Dansereau Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada e-mail: [email protected] R. M. Dansereau e-mail: [email protected] M. H. Radfar · W.-Y. Chan Department of Electrical and Computer Engineering, Queen’s University, Kingston, Canada W.-Y. Chan e-mail: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Singing Voice Separation from Monaural Music Based on Kernel Back-Fitting Using Beta-Order Spectral Amplitude Estimation

Separating the leading singing voice from the musical background from a monaural recording is a challenging task that appears naturally in several music processing applications. Recently, kernel additive modeling with generalized spatial Wiener filtering (GW) was presented for music/voice separation. In this paper, an adaptive auditory filtering based on β-order minimum mean-square error spectr...

متن کامل

Speech separation based on the GMM PDF estimation

In this paper, the speech separation task will be regarded as a convolutive mixture Blind Source Separation (BSS) problem. The Maximum Entropy (ME) algorithm, the Minimum Mutual Information (MMI) algorithm and the Maximum Likelihood (ML) algorithm are main approaches of the algorithms solving the BSS problem. The relationship of these three algorithms has been analyzed in this paper. Based on t...

متن کامل

کاربرد الگوریتم جداسازی کور منابع در جداسازی سیگنال‌های گفتار و موسیقی

In this paper, the application of the Independent Component Analysis In this paper, the application of the Independent Component Analysis technique in speech-music separation is discussed. The separation algorithm is in the time domain. It needs the score function estimation to minimize the mutual information. For estimating score function, sufficient samples of the mixed (speech-music) signals...

متن کامل

Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator

In this paper, to achieve high-quality speech enhancement, we introduce the generalized minimum mean-square error shorttime spectral amplitude estimator with a new blind prior estimation of the speech probability density function (p.d.f.). To deal with various types of speech signals with different p.d.f., we propose an algorithm of speech kurtosis estimation based on moment-cumulant transforma...

متن کامل

HMM-based channel error mitigation and its application to distributed speech recognition

The emergence of distributed speech recognition has generated the need to mitigate the degradations that the transmission channel introduces in the speech features used for recognition. This work proposes a hidden Markov model (HMM) framework from which different mitigation techniques oriented to wireless channels can be derived. First, we study the performance of two techniques based on the us...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Signal Processing Systems

دوره 61 شماره

صفحات -

تاریخ انتشار 2010

Monaural Speech Separation Based on Gain Adapted Minimum Mean Square Error Estimation

نویسندگان

چکیده

منابع مشابه

Singing Voice Separation from Monaural Music Based on Kernel Back-Fitting Using Beta-Order Spectral Amplitude Estimation

Speech separation based on the GMM PDF estimation

کاربرد الگوریتم جداسازی کور منابع در جداسازی سیگنال‌های گفتار و موسیقی

Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator

HMM-based channel error mitigation and its application to distributed speech recognition

عنوان ژورنال:

اشتراک گذاری